The PAPILLON Project: Cooperatively Building a Multilingual Lexical Data-base to Derive Open Source Dictionaries & Lexicons
نویسندگان
چکیده
The PAPILLON project aims at creating a cooperative, free, permanent, web-oriented and personalizable environment for the development and the consultation of a multilingual lexical database. The initial motivation is the lack of dictionaries, both for humans and machines, between French and many Asian languages. In particular, although there are large F-J paper usage dictionaries, they are usable only by Japanese literates, as they never contain both original (kanji/kana) and romaji writing. This applies as well to Thai, Vietnamese, Lao, etc.
منابع مشابه
The PAPILLON Project: Cooperatively Building A Multilingual Lexical Data-Base To Derive Open Source Dictionaries And Lexicons
The PAPILLON project aims at creating a cooperative, free, permanent, web-oriented and personalizable environment for the development and the consultation of a multilingual lexical database. The initial motivation is the lack of dictionaries, both for humans and machines, between French and many Asian languages. In particular, although there are large F-J paper usage dictionaries, they are usab...
متن کاملPapillon Lexical Database Project: Monolingual Dictionaries & Interlingual Links
This paper presents a new research and development project called Papillon. It started as a French-Japanese cooperation between laboratories GETA/CLIPS (Grenoble, France) and NII (Tokyo, Japan). Its goal is to build a multilingual lexical database and to extract from it digital bilingual dictionaries. The database is built with monolingual dictionaries, one for each language of the database, li...
متن کاملThe translation of examples , citations , definitions and glosses in the Papillon project
The Papillon lexical data base comprises a set of detailed monolingual dictionaries of « lexies » (word senses) interlinked through « axies » (interlingual links) which can also refer to external semanticsoriented systems such as UNL « universal words », Worldnet « synsets », Ontos « concepts » or NTT ALT/JE system « semantic classes ». The basic idea is that bilingual or multitarget usage dict...
متن کاملData Repository Organization and Recuperation Process for Multilingual Lexical Databases1
This paper describes the data management in multilingual lexical databases. Since NLP systems are using lexical data, the amount of work to build them is huge. That is the reason why it is important to use rigorous powerful systems and to be able to get data for a minimal cost. As an open source project, Papillon is reusing existing lexical data and wants to make volunteers collaborate. To make...
متن کاملCreating a Multilingual Data Collection for Bilingual Lexicography from Parallel Monolingual Lexicons
In the DELIS project 1 , a set of parallel monolingual lexicon fragments have been designed. They are parallel in two ways: (1) they cover the same fragment (the most general verbs of sensory perception and of speech act), and (2) they are based on the same theoretical approaches and on comparable classiications and descriptive devices. It is claimed in this paper that such parallel fragments, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002